Structural Phrase Alignment Based on Consistency Criteria

نویسندگان

  • Toshiaki Nakazawa
  • Yu Kun
  • Sadao Kurohashi
چکیده

In this paper, we propose a new method for phrase alignment using a dependency type distance and a distance-score function. With this method, appropriate correspondences can be selected among correspondence candidates that often include ambiguous or incorrect ones. Furthermore, this method makes it possible to measure the overall alignment consistency. We conduct an alignment experiment using 500 parallel sentences on newspaper domain, and achieve an F-measure improvement of 35 points over the simple statistical method (GIZA++), and 3.0 points over a baseline system. We also conducted a translation experiment and achieved a BLEU score improvement of 0.4 points over a baseline system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality

We present a novel approach to improve word alignment for statistical machine translation (SMT). Conventional word alignment methods allow discontinuous alignment, meaning that a source (or target) word links to several target (or source) words whose positions are discontinuous. However, we cannot extract phrase pairs from this kind of alignments as they break the alignment consistency constrai...

متن کامل

Kyoto-U: Syntactical EBMT System for NTCIR-7 Patent Translation Task

This paper describes “Kyoto-U” MT system that attended the patent translation task at NTCIR-7. Example-based machine translation is applied in this system to integrate our study on both structural NLP and machine translation. In the alignment step, consistency criteria are applied to solve the alignment ambiguities and to discard incorrect alignment candidates. In the translation step, translat...

متن کامل

Statistical Phrase Alignment Model Using Dependency Relation Probability

When aligning very different language pairs, the most important needs are the use of structural information and the capability of generating one-to-many or many-to-many correspondences. In this paper, we propose a novel phrase alignment method which models word or phrase dependency relations in dependency tree structures of source and target languages. The dependency relation model is a kind of...

متن کامل

An iterative refinement algorithm for consistency based multiple structural alignment methods

MOTIVATION Multiple STructural Alignment (MSTA) provides valuable information for solving problems such as fold recognition. The consistency-based approach tries to find conflict-free subsets of alignments from a pre-computed all-to-all Pairwise Alignment Library (PAL). If large proportions of conflicts exist in the library, consistency can be hard to get. On the other hand, multiple structural...

متن کامل

Discriminative Phrase-based Lexicalized Reordering Models using Weighted Reordering Graphs

Lexicalized reordering models play a central role in phrase-based statistical machine translation systems. Starting from the distance-based reordering model, improvements have been made by considering adjacent words in word-based models, adjacent phrases pairs in phrasebased models, and finally, all phrases pairs in a sentence pair in the reordering graphs. However, reordering graphs treat all ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007